AITopics

2512.07519

Country:

Europe (0.46)
North America > United States > Wisconsin (0.14)

Genre: Research Report (0.50)

Industry:

Leisure & Entertainment > Games > Chess (1.00)
Health & Medicine > Therapeutic Area (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.96)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.89)

arXiv.org Artificial IntelligenceDec-16-2024

Can LLM Prompting Serve as a Proxy for Static Analysis in Vulnerability Detection

Ceka, Ira, Qiao, Feitong, Dey, Anik, Valechia, Aastha, Kaiser, Gail, Ray, Baishakhi

Despite their remarkable success, large language models (LLMs) have shown limited ability on applied tasks such as vulnerability detection. We investigate various prompting strategies for vulnerability detection and, as part of this exploration, propose a prompting strategy that integrates natural language descriptions of vulnerabilities with a contrastive chain-of-thought reasoning approach, augmented using contrastive samples from a synthetic dataset. Our study highlights the potential of LLMs to detect vulnerabilities by integrating natural language descriptions, contrastive reasoning, and synthetic examples into a comprehensive prompting framework. Our results show that this approach can enhance LLM understanding of vulnerabilities. On a high-quality vulnerability detection dataset such as SVEN, our prompting strategies can improve accuracies, F1-scores, and pairwise accuracies by 23%, 11%, and 14%, respectively.

large language model, machine learning, natural language, (20 more...)

2412.12039

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Massachusetts > Hampshire County > Amherst (0.04)
Europe > Switzerland > Basel-City > Basel (0.04)
Asia > Nepal (0.04)

Genre: Research Report > New Finding (1.00)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Visani, Giorgio, Stanzione, Vincenzo, Garreau, Damien

GLEAMS: Bridging the Gap Between Local and Global Explanations

arXiv.org Artificial IntelligenceAug-9-2024

The explainability of machine learning algorithms is crucial, and numerous methods have emerged recently. Local, post-hoc methods assign an attribution score to each feature, indicating its importance for the prediction. However, these methods require recalculating explanations for each example. On the other side, while there exist global approaches they often produce explanations that are either overly simplistic and unreliable or excessively complex. To bridge this gap, we propose GLEAMS, a novel method that partitions the input space and learns an interpretable model within each sub-region, thereby providing both faithful local and global surrogates. We demonstrate GLEAMS' effectiveness on both synthetic and real-world data, highlighting its desirable properties and human-understandable insights.

attribution, explanation, gleam, (17 more...)

2408.0506

Country:

Europe > Italy > Emilia-Romagna > Metropolitan City of Bologna > Bologna (0.04)
Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)
Europe > Germany > Bavaria > Lower Franconia > Würzburg (0.04)

Genre: Research Report (1.00)

Industry: Information Technology > Security & Privacy (0.93)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.68)

Neural Information Processing SystemsMar-13-2024, 17:01:23 GMT

6081594975a764c8e3a691fa2b3a321d-Reviews.html

This paper proposes a new boosting method that represents a tradeoff between online and offline learning. The main idea of the method is to maintain a reservoir of training examples (of fixed size) from which to train the weak learners. At each boosting iteration, new examples are added to the reservoir and then a selection strategy is used to reduce the reservoir to its original fixed size before the weak learner is trained. Several naive selection strategies are proposed but the main contribution of the paper is a more sophisticated selection strategy whose goal is to remove examples from the reservoir so that a weak learner trained on the reduced set will minimize the error computed on the whole set before reduction. The resulting algorithm is applied on four computer vision datasets, where it is shown to outperform several other online boosting methods. The idea of using a reservoir is original and very interesting.

algorithm, reservoir, weak learner, (15 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.55)

Neural Information Processing SystemsMar-13-2024, 16:31:37 GMT

One-shot learning by inverting a compositional causal process

People can learn a new visual class from just one example, yet machine learning algorithms typically require hundreds or thousands of examples to tackle the same problems. Here we present a Hierarchical Bayesian model based on compositionality and causality that can learn a wide range of natural (although simple) visual concepts, generalizing in human-like ways from just one image. We evaluated performance on a challenging one-shot classification task, where our model achieved a human-level error rate while substantially outperforming two deep learning models. We also tested the model on another conceptual task, generating new examples, by using a "visual Turing test" to show that our model produces human-like performance.

hbpl, new example, participant, (11 more...)

Country:

North America > Canada > Ontario > Toronto (0.14)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)
North America > United States > New York (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.87)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.34)

arXiv.org Artificial IntelligenceAug-9-2023

Adversarial Word Dilution as Text Data Augmentation in Low-Resource Regime

Chen, Junfan, Zhang, Richong, Luo, Zheyan, Hu, Chunming, Mao, Yongyi

Data augmentation is widely used in text classification, especially in the low-resource regime where a few examples for each class are available during training. Despite the success, generating data augmentations as hard positive examples that may increase their effectiveness is under-explored. This paper proposes an Adversarial Word Dilution (AWD) method that can generate hard positive examples as text data augmentations to train the low-resource text classification model efficiently. Our idea of augmenting the text data is to dilute the embedding of strong positive words by weighted mixing with unknown-word embedding, making the augmented inputs hard to be recognized as positive by the classification model. We adversarially learn the dilution weights through a constrained min-max optimization process with the guidance of the labels. Empirical studies on three benchmark datasets show that AWD can generate more effective data augmentations and outperform the state-of-the-art text data augmentation methods. The additional analysis demonstrates that the data augmentations generated by AWD are interpretable and can flexibly extend to new examples without further training.

machine learning, natural language, text classification, (14 more...)

2305.09287

Country:

North America > United States > Connecticut (0.04)
Asia > China > Beijing > Beijing (0.04)
North America > Canada > Ontario > National Capital Region > Ottawa (0.04)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Classification (0.72)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Neural Information Processing SystemsApr-6-2023, 18:38:30 GMT

Active Learning with Statistical Models

An active learning problem is one where the learner has the ability or need to influence or select its own training data. Many problems of great practical interest allow active learning, and many even require it. We consider the problem of actively learning a mapping X - Y based on a set of training examples {(Xi,Yi)} l' where Xi E X and Yi E Y. The learner is allowed to iteratively select new inputs x (possibly from a constrained set), observe the resulting output y, and incorporate the new examples (x, y) into its training set. The primary question of active learning is how to choose which x to try next. There are many heuristics for choosing x based on intuition, including choosing places where we don't have data, where we perform poorly [Linden and Weber, 1993], where we have low confidence [Thrun and Moller, 1992], where we expect it

learner, learning architecture, statistical model, (10 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.57)

Neural Information Processing SystemsApr-6-2023, 16:38:22 GMT

Discriminative Direction for Kernel Classifiers

Once a classifier is estimated from the training data, it can be used to label new examples, and in many application domains, such as character recognition, text classification and oth- ers, this constitutes the final goal of the learning stage. The statistical learning algorithms are also used in scientific studies to detect and analyze differences between the two classes when the correct answer'' is unknown, and the information we have on the differences is represented implicitly by the training set. Example applications include morphologi- cal analysis of anatomical organs (comparing organ shape in patients vs. normal controls), molecular design (identifying complex molecules that satisfy certain requirements), etc. In such applications, interpretation of the resulting classifier in terms of the original feature vectors can provide an insight into the nature of the differences detected by the learning algorithm and is therefore a crucial step in the analysis. Furthermore, we would argue that studying the spatial structure of the data captured by the classification function is important in any application, as it leads to a better understanding of the data and can potentially help in improving the technique.

application, classifier, discriminative direction, (8 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.57)

#artificialintelligenceMar-16-2023, 21:45:56 GMT

PiEEG

PiEEG is an open source Raspberry Pi shield that measures biosignals such as those used in electroencephalography (EEG), electromyography (EMG), and electrocardiography (ECG). PiEEG is versatile, easy to work with, and compatible with different types of electrodes. Best of all, it was designed to be usable by anyone. To begin measuring bio-signals, all you need to do is connect the electrodes and run a Python script. Applications include gaming, entertainment, sports, health, meditation, and more.

crowd supply, data processing, pieeg, (6 more...)

#artificialintelligence

Industry:

Health & Medicine > Diagnostic Medicine (0.94)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.58)
Health & Medicine > Therapeutic Area > Neurology (0.38)

Technology:

Information Technology > Software (0.40)
Information Technology > Artificial Intelligence (0.34)

#artificialintelligenceMar-4-2023, 12:55:17 GMT

Data Augmentation: Transforming Your Training Data from Meh to Marvelous

Today, we're going to talk about one of my favorite topics: Data Augmentation. Yes, I know, it may sound a bit dry and technical at first, but trust me, this is one of the most exciting and creative aspects of deep learning. So buckle up and let's dive in! First things first, let's define our terms. Data augmentation is a technique used in deep learning to increase the amount of training data by creating new examples from the existing ones.

augmentation, data augmentation, transformation, (12 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.82)